48 research outputs found

    Towards a Cloud-Based Service for Maintaining and Analyzing Data About Scientific Events

    Full text link
    We propose the new cloud-based service OpenResearch for managing and analyzing data about scientific events such as conferences and workshops in a persistent and reliable way. This includes data about scientific articles, participants, acceptance rates, submission numbers, impact values as well as organizational details such as program committees, chairs, fees and sponsors. OpenResearch is a centralized repository for scientific events and supports researchers in collecting, organizing, sharing and disseminating information about scientific events in a structured way. An additional feature currently under development is the possibility to archive web pages along with the extracted semantic data in order to lift the burden of maintaining new and old conference web sites from public research institutions. However, the main advantage is that this cloud-based repository enables a comprehensive analysis of conference data. Based on extracted semantic data, it is possible to determine quality estimations, scientific communities, research trends as well the development of acceptance rates, fees, and number of participants in a continuous way complemented by projections into the future. Furthermore, data about research articles can be systematically explored using a content-based analysis as well as citation linkage. All data maintained in this crowd-sourcing platform is made freely available through an open SPARQL endpoint, which allows for analytical queries in a flexible and user-defined way.Comment: A completed version of this paper had been accepted in SAVE-SD workshop 2017 at WWW conferenc

    Mapping Large Scale Research Metadata to Linked Data: A Performance Comparison of HBase, CSV and XML

    Full text link
    OpenAIRE, the Open Access Infrastructure for Research in Europe, comprises a database of all EC FP7 and H2020 funded research projects, including metadata of their results (publications and datasets). These data are stored in an HBase NoSQL database, post-processed, and exposed as HTML for human consumption, and as XML through a web service interface. As an intermediate format to facilitate statistical computations, CSV is generated internally. To interlink the OpenAIRE data with related data on the Web, we aim at exporting them as Linked Open Data (LOD). The LOD export is required to integrate into the overall data processing workflow, where derived data are regenerated from the base data every day. We thus faced the challenge of identifying the best-performing conversion approach.We evaluated the performances of creating LOD by a MapReduce job on top of HBase, by mapping the intermediate CSV files, and by mapping the XML output.Comment: Accepted in 0th Metadata and Semantics Research Conferenc

    Una revisión de la investigación de Global Reporting Initiative (GRI) con informes de sostenibilidad: conjunto de datos 1999-2020

    Get PDF
    This review paper aims to identify the main areas of studies in the field of the Global Reporting Initiative (GRI) research with sustainability reports. Using a bibliometric analysis, this study evaluated 955 published documents retrieved from the Scopus database to find a research review structure on GRI with sustainability topics from 1999 to 2020 by utilizing the bibliometric package in VOSviewer and Harzing’s Publish or Perish. This paper examined the most effective journals, authors, countries, institutions, subject area, keywords, citation, co-authorship, co-citation, bibliographic coupling, and co-occurrences networks. Also, this paper demonstrated the intellectual structure of the research and perceived obstacles to growth in the literature. The results show that the trend of publications has been growing over the past 20 years. This study offers a comprehensive understanding and publication of past studies trends and suggests that it will be a much greater number of articles in this field over the next decade which help the future direction of researchers in this area.Este trabajo tiene como objetivo identificar las principales áreas de estudio en la investigación de Global Reporting Initiative (GRI). Mediante un análisis bibliométrico, este estudio evaluó 955 documentos publicados recuperados de la base de datos Scopus para encontrar una estructura de revisión de la investigación sobre GRI con temas de sostenibilidad desde 1999 hasta 2020 utilizando el paquete bibliométrico de VOSviewer y Publish or Perish de Harzing. Este trabajo examinó las revistas, autores, países, instituciones, área temática, palabras clave, citación, coautoría, co-citación, acoplamiento bibliográfico y redes de co-ocurrencias más eficaces. Asimismo, este trabajo demostró la estructura intelectual de la investigación y los obstáculos percibidos para el crecimiento de la bibliografía. Los resultados muestran que en los últimos 20 años la tendencia de las publicaciones ha ido en aumento. Este estudio ofrece una comprensión global y la publicación de las tendencias de los estudios anteriores y sugiere que habrá un número mucho mayor de artículos en este campo durante la próxima década

    Scholarly event characteristics in four fields of science : a metrics-based analysis

    Get PDF
    One of the key channels of scholarly knowledge exchange are scholarly events such as conferences, workshops, symposiums, etc.; such events are especially important and popular in Computer Science, Engineering, and Natural Sciences.However, scholars encounter problems in finding relevant information about upcoming events and statistics on their historic evolution.In order to obtain a better understanding of scholarly event characteristics in four fields of science, we analyzed the metadata of scholarly events of four major fields of science, namely Computer Science, Physics, Engineering, and Mathematics using Scholarly Events Quality Assessment suite, a suite of ten metrics.In particular, we analyzed renowned scholarly events belonging to five sub-fields within Computer Science, namely World Wide Web, Computer Vision, Software Engineering, Data Management, as well as Security and Privacy.This analysis is based on a systematic approach using descriptive statistics as well as exploratory data analysis. The findings are on the one hand interesting to observe the general evolution and success factors of scholarly events; on the other hand, they allow (prospective) event organizers, publishers, and committee members to assess the progress of their event over time and compare it to other events in the same field; and finally, they help researchers to make more informed decisions when selecting suitable venues for presenting their work.Based on these findings, a set of recommendations has been concluded to different stakeholders, involving event organizers, potential authors, proceedings publishers, and sponsors. Our comprehensive dataset of scholarly events of the aforementioned fields is openly available in a semantic format and maintained collaboratively at OpenResearch.org. © 2020, The Author(s)

    5* Knowledge Graph Embeddings with Projective Transformations

    Full text link
    Performing link prediction using knowledge graph embedding (KGE) models is a popular approach for knowledge graph completion. Such link predictions are performed by measuring the likelihood of links in the graph via a transformation function that maps nodes via edges into a vector space. Since the complex structure of the real world is reflected in multi-relational knowledge graphs, the transformation functions need to be able to represent this complexity. However, most of the existing transformation functions in embedding models have been designed in Euclidean geometry and only cover one or two simple transformations. Therefore, they are prone to underfitting and limited in their ability to embed complex graph structures. The area of projective geometry, however, fully covers inversion, reflection, translation, rotation, and homothety transformations. We propose a novel KGE model, which supports those transformations and subsumes other state-of-the-art models. The model has several favorable theoretical properties and outperforms existing approaches on widely used link prediction benchmarks
    corecore